CDS

Accession Number TCMCG075C22678
gbkey CDS
Protein Id XP_017980106.1
Location join(16990747..16990986,16991345..16991615,16991738..16992110,16992218..16992436,16992558..16992699,16993006..16993254,16993378..16993671)
Gene LOC18594730
GeneID 18594730
Organism Theobroma cacao

Protein

Length 595aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018124617.1
Definition PREDICTED: probable terpene synthase 6 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Terpene synthase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07648        [VIEW IN KEGG]
R09614        [VIEW IN KEGG]
R10598        [VIEW IN KEGG]
R10599        [VIEW IN KEGG]
KEGG_rclass RC02425        [VIEW IN KEGG]
RC02581        [VIEW IN KEGG]
RC03207        [VIEW IN KEGG]
RC03208        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K15799        [VIEW IN KEGG]
ko:K15803        [VIEW IN KEGG]
ko:K22467        [VIEW IN KEGG]
EC 4.2.3.195        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
4.2.3.69        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
4.2.3.75        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
4.2.3.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
4.2.3.79        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00909        [VIEW IN KEGG]
map00909        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCACTCCAAGCCAGTATTTTCACTAAGTCCTGCTTCCACCGAGGCGTTTATATGCCAACTCTTCCGTCCAAATTTCGGGGGAGACAATGCTTGGCTTCCACAAACTCAATGGCTGCAGCAGCCCTGCAAGGCTCTCCGCTGTCGGCAAACACTCACCAAGAAGTCTTTCGTCCTTTGGCAGACTTCCCCCCGGATATATGGGGCGACTGTTTCATGTCTCTCTCGCTCGATAACTTGGAATTTGAATCACTCTGTAGACAAGTAGAGGTGTTGAAAAAGAAGGTGAAGGGCATGTTATCGGCTCCGAGTGACCAAGTAGAAAAAATCCTCTTGATCAACTCTCTATGCCGCCTTGGATTATCGTACCACTTTGAGAATGAGATTGAAGAGCAATTAAGTTACCTTTTTGTTTCATTATCTAAACATATGGATGATAAAGACTATGACTTGGAAACAGTTGCAGCGATATTTCAAGTTTTCAGGCTACATGGTTATCGAATGCGCTGTGATGTGTTTAACAAGTTTAAGGAGGGTGATGGTGAGTTCAAGGAAGTGTTAGCTAGTGATGTCAAGGGCATCCTAAGCTTGTATGAAGCTAGCCAGTTCAGAATAAATGGCGAGAAAATTTTAGATGAAGCCCTTGCTTTCACAACGAAGCACTTAGAGTCCTTGACAGACCAATCAAGTCCCCATCTTAGAGAATACATAGGAAACGCTTTGAACCGACCTTATCACAAAGGCATGCCGAGAGTGGAAGCAAGGCAATATATAACTTTCTATGAAAAAGAAGAATCGCCCAATGAAACATTGCTCAAGCTCGCAAAATATGATTTTAACCGAGTCCAATTTCTACACCAGCAAGAATTAAGCATCCTTTCGAGTTGGTCCAAAGACTTGAACATAGCATCACAACTTTCTTACGCCAGAAACAGAACGGTGGAGATCTTTTTTTGGACAGTTGGATTTTATTTCGAGCCACGTTATGCACTTGCCCGAAACATATTCACTAAGCTGCTGATCATCTTAGGATTTATAGATGATACCTATGACGCATATGGTACCTTCGAAGAACTCCAATGTTTCACAGATGCAATACAAAGGTGGGATATTAGTGCCCTTGATCAGCTGCCCGCAGATTATTTGAAATTTCTTTATGGGGCACTCCTTAATGTTTATGATGAAGTGGATAGAATGGTGAGCATGGATGGGAGATGTTACAGCATGTCTTTTACCAAAGATGAGTTGAAGAAAATTGTTATTTCCTACCTGGTTGAAGCTCAGTGGACGCATGAAGGTTATATGCCAACATTCGATGAGTATTTGGACATTGCATTACATTCAAGTGCAGCCATTCTAGTGATTGCTGAAGTCTTGGTCGGAATGGAAGAAGCAGATGCCAATGTTTTTGAATGGTTGAGACAAGGTGACAGTAAATCTCTTGCAGCAATAAAAATAATTGGCCGTCTCTATGATGACATCGCAACCAATGAGGATGAGGAAAAGAGAGGACTAGTTGCTTGTGGAATCAAATGTTATATGAAGCAATATGGCGTTTCAAAGGAAGAAGCTATTGAAGAATTTCGAAAAAGACTTGTCATTGCTTGGAATGAGCTTAATGAAGATCATATGAGGCCAACGACTGTCCCAATGGAAATCCTTAATCGTGTTCGTAACATTGCATGTGTAATAGATCTTACATACAAGGATGAGGATGGATTTACCATGTCTGAGAAAATTTTGAAAGACCACATTACCAAAGTGCTCATTGAGCCCATTCCTATTTGA
Protein:  
MALQASIFTKSCFHRGVYMPTLPSKFRGRQCLASTNSMAAAALQGSPLSANTHQEVFRPLADFPPDIWGDCFMSLSLDNLEFESLCRQVEVLKKKVKGMLSAPSDQVEKILLINSLCRLGLSYHFENEIEEQLSYLFVSLSKHMDDKDYDLETVAAIFQVFRLHGYRMRCDVFNKFKEGDGEFKEVLASDVKGILSLYEASQFRINGEKILDEALAFTTKHLESLTDQSSPHLREYIGNALNRPYHKGMPRVEARQYITFYEKEESPNETLLKLAKYDFNRVQFLHQQELSILSSWSKDLNIASQLSYARNRTVEIFFWTVGFYFEPRYALARNIFTKLLIILGFIDDTYDAYGTFEELQCFTDAIQRWDISALDQLPADYLKFLYGALLNVYDEVDRMVSMDGRCYSMSFTKDELKKIVISYLVEAQWTHEGYMPTFDEYLDIALHSSAAILVIAEVLVGMEEADANVFEWLRQGDSKSLAAIKIIGRLYDDIATNEDEEKRGLVACGIKCYMKQYGVSKEEAIEEFRKRLVIAWNELNEDHMRPTTVPMEILNRVRNIACVIDLTYKDEDGFTMSEKILKDHITKVLIEPIPI